Institute of Statistics Mimeo Series #2567 Cladistic Grouping of Haplotypes in Association Analysis
نویسنده
چکیده
Haplotypes represent underlying polymorphisms more than single SNPs, and are considered as a more informative format of data in association analysis. To model haplotypes, it requires high degrees of freedom, which could decrease power and limit a model’s capacity to incorporate other complex effects such as interactions. Even within haplotype blocks, high degrees of freedom are still a concern unless one chooses to discard rare haplotypes. To increase the efficiency and power of haplotype analysis, we adapt the concepts of cladistic analysis and propose a grouping algorithm to cluster rare haplotypes to the corresponding ancestral haplotypes. The algorithm determines the cluster bases by preserving common haplotypes using a criterion built on the Shannon information content. Each haplotype is then assigned to its appropriate clusters probabilistically according to the cladistic relationship. Through this algorithm, we perform association analysis based on groups of haplotypes. Simulation results indicate power increases for performing tests on the haplotype clusters when compared to tests using original haplotypes or the truncated haplotype distribution.
منابع مشابه
Separating population structure from population history: a cladistic analysis of the geographical distribution of mitochondrial DNA haplotypes in the tiger salamander, Ambystoma tigrinum.
Nonrandom associations of alleles or haplotypes with geographical location can arise from restricted gene flow, historical events (fragmentation, range expansion, colonization), or any mixture of these factors. In this paper, we show how a nested cladistic analysis of geographical distances can be used to test the null hypothesis of no geographical association of haplotypes, test the hypothesis...
متن کاملIncorporating Single-Locus Tests into Haplotype Cladistic Analysis in Case-Control Studies
In case-control studies, genetic associations for complex diseases may be probed either with single-locus tests or with haplotype-based tests. Although there are different views on the relative merits and preferences of the two test strategies, haplotype-based analyses are generally believed to be more powerful to detect genes with modest effects. However, a main drawback of haplotype-based ass...
متن کاملEvolutionary-based association analysis using haplotype data.
Association studies, both family-based and population-based, can be powerful means of detecting disease-liability alleles. To increase the information of the test, various researchers have proposed targeting haplotypes. The larger number of haplotypes, however, relative to alleles at individual loci, could decrease power because of the additional degrees of freedom required for the test. An opt...
متن کاملStrong Association of CTLA-4 Variation (CT60A/G) and CTLA-4 Haplotypes with Predisposition of Iranians to Head and Neck Cancer
Background: Variations in Cytotoxic T Lymphocyte Antigen-4 (CTLA-4) affect the expression and function of this protein. Objective: We aimed to investigate the association of +49 A/G (rs231775), +1822 C/T (rs231779) and +6230 A/G (CT60, rs3087243) genetic variations, as well as the merged haplotypes in CTLA-4 gene with susceptibility to, or progression of head and neck cancer. Methods: Eighty pa...
متن کاملCladistic analysis of genotype data-application to GAW15 Problem 3
Given the increasing size of modern genetic data sets and, in particular, the move towards genome-wide studies, there is merit in considering analyses that gain computational efficiency by being more heuristic in nature. With this in mind, we present results of cladistic analyses methods on the Genetic Analysis Workshop 15 Problem 3 simulated data (answers known). Our analysis attempts to captu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006